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(57) ABSTRACT 

A flow control system and methodology controls and avoids 
congestion in an Asynchronous Transfer Mode (ATM) net- 
work. The congestion state of a network switch is deter- 
mined based on incoming Available Bit Rate (ABR) traffic, 
Constant Bit Rate (CBR) traffic, and Variable Bit Rate 
(VBR) traffic. High-frequency fluctuations due to VBR 
traffic are filtered out of the CBR/VBR traffic to determine 
the underlying trend of network traffic. A filtering mecha- 
nism is based on wavelet transforms. The switch congestion 
state is determined based on the current ABR traffic, the 
filtered CBR/VBR traffic, and the current status of buffers in 
the switch. The congestion state can then be used to adjust 
the ABR rate according to a feedback scheme such as 
Explicit Forward Congestion Indication (EFCI). 
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ABR FLOW CONTROL USING SINGLE BIT 
CONGESTION INDICATION AND WAVELET 
TRANSFORM FILTERING 

BACKGROUND OF THE INVENTION 

The present invention relates to congestion control and 
avoidance in a communications network and, more 
particularly, to controlling and avoiding congestion in an 
Asynchronous Transfer Mode (ATM) network using a 
wavelet-based filtering mechanism in an Available Bit Rate 
(ABR) Explicit Forward Congestion Indication (EFCI) flow 
control solution. 

The ATM Forum Traffic Management Specification, Ver- 
sion 4.0, 1996 (TM4.0), currently defines five service classes 
to support the diverse requirements of multimedia traffic: 
Constant Bit Rate (CBR), Real-Time Variable Bit Rate 
(RT-VBR), Non-Real-Time Variable Bit Rate (NRT-VBR), 
Available Bit Rate (ABR), and Unspecified Bit Rate (UBR). 
Both CBR and RT-VBR service classes provide stringent 
cell transmission delay (CTD), cell delay variation (CDV), 
and cell loss ratio (CLR) constraints. NRT-VBR service 
provides stringent CID and CLR constraints but no CDV 
guarantees. CBR, RT-VBR, and NRT-VBR all rely on the 
network reservation of resources to provide a required 
quality of service (QoS). 

ABR and UBR service classes are defined for applications 
with unpredictable bandwidth requirements, that are sensi- 
tive to cell loss, but are able to tolerate a certain amount of 
delay. ABR and UBR service classes are designed so that 
applications in these classes can grab any unused network 
resources that VBR and CBR traffic does not utilize, i.e., 
unused bandwidth and buffer space. Gains due to statistical 
resource utilization, however, come at the risk of potential 
congestion when many applications compete for network 
resources. Therefore, proper congestion control must be in 
place to ensure that network resources can be shared in a fair 
manner and that performance objectives such as cell loss 
ratio can be maintained. 

UBR service is truly "best-effort/' and thus the network 
provides no service guarantees for UBR. ABR, however, 
provides some service guarantees. Therefore, ABR must 
comply with the flow control framework specified in TM4.0, 
which supports several types of feedback to control the 
source rate in response to changing transfer characteristics. 
This feedback information is conveyed to the source, which 
adapts its traffic in accordance with the feedback. The 
feedback information includes the state of congestion and a 
fair share of the available bandwidth according to a network- 
specific allocation policy. To ensure interoperability, an 
ABR end system must always implement the source and 
destination behavior defined in TM4.0. In particular, the cell 
rate from an ABR source is bounded by a Peak Cell Rate 
(PCR) and Minimum Cell Rate (MCR) and must abide by a 
Cell Delay Variation Tolerance (CDVT). 

The ABR congestion control scheme specified in TM4.0 
is a rate-based, closed-loop, per-connection control that 
utilizes the feedback information from the network to regu- 
late the rate of cell transmission at the source. FIG. 1 
illustrates the basic operation of ABR congestion control. In 
network 100, source 102 transmits data cells 108 to desti- 
nation 104 via one or more switches in network 100, one of 
which is shown as switch 106 in FIG. 1. Source 102 also 
generates special probe cells 110 referred to as resource 
management (RM) cells in proportion to its current data cell 
rate. Destination 104 turns around and sends RM cells 110 
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back to source 102 in the other direction. The RM cells, 
which can be examined and modified by the switches in both 
forward and backward directions, carry feedback informa- 
tion of the state of congestion and the fair rate allocation. 

5 The following summarizes the operation of the rate-based 
control scheme, the details of which, along with the detailed 
RM cell format, are found in TM4.0, which is incorporated 
herein by reference. 
At switch shall implement at least one of the following 

10 methods to control congestion: 

a) Explicit Forward Congestion Indication (EFCI) mark- 
ing: The ATM switch may set the EFCI state in the data 
cell headers. Most first-generation ATM switches 
implemented this mechanism even before the RM cell 

15 was fully defined. 

b) Relative rate marking: The ATM switch may set t he 
congestion indication (CI) bit or the no increase (NI) bit 
in the forward and/or backward RM cells. 

c) Explicit rate marking: The ATM switch may reduce the 
20 explicit rate (ER) field of forward and/or backward RM 

cells. 

Switches that implement EFCI marking and relative rate 
marking are known as binary switches, which are simple to 
implement but may result in unfairness, congestion 

25 oscillation, and slow congestion response. Switches that 
implement explicit rate marking are generally referred to as 
ER switches and require sophisticated mechanisms at the 
switches to compute a fair share of the bandwidth. The 
TM4.0 standard-defined source and destination behaviors, 

30 however, allow interoperation of all three congestion control 
options according to the following rules. 

Once the source has received permission, it begins sched- 
uling cells for transmission at the allowed cell rate (ACR). 
The ACR is initially set to the initial cell rate (ICR) and is 

35 always bounded between the minimum cell rate (MCR) and 
the peak cell rate (PCR) specified by the application at call 
setup. An RM cell precedes transmission of any data cells. 
The source continues to send RM cells, typically after every 
Nrm data cells, where Nrm is the maximum number of data 

40 cells a source may send for each forward RM cell. The 
source places the ACR value in the current cell rate (CCR) 
field of the RM cell, and i:the rate at which it wishes to 
transmit cells (usually the PCR value) in the ER field. The 
RM cells traverse forward through the network, and the 

45 destination turns the RM cells around in the backward 
direction. Intermediate switches on the path notify the 
source of congestion by marking the EFCI bit in the data 
cells and the CI or NI bit in the RM cell, and/or reducing the 
ER value in the RM cells. 

so Upon receiving the RM cell in return, the source should 
adapt its ACR to the information carried in the RM cell If 
the CI bit is not set, the source may linearly increase its ACR 
by a fixed increment (RIF* PCR), where the rate increase 
factor (RIF) is determined at call setup. This increase can 

55 reach the ER value in the RM cell, but should never exceed 
the PCR. If the CI bit is set, the source must exponentially 
decrease its ACR by an amount greater than or equal to a 
proportion of its current ACR (RDF* ACR), where the rate 
decrease factor (RDF) is also determined at call setup. The 

60 factors RIF and RDF control the rate at which the sources 
increase and decrease, respectively. 

If the ACR is still greater than the returned ER, the source 
must further decrease its ACR to the returned ER value, 
although it should never be below the MCR. If the NI bit is 

65 set, the source should observe the CI and ER fields in the 
RM cell, but it is not allowed to increase the ACR above its 
current value. 
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In EFCI marking binary feedback control schemes, the Operation of a rate-thresholding scheme that monitors 
ATM switch sets the EFCI bit in the AIM cell header during low-pass filtered cell arrival rates can be described math- 
congestion. As described above, end systems use this one-bit ematically as follows. Let C denote the link capacity and p r 
information contained in each cell to increase/decrease their denote a targeted utilization (e.g., 0.9) so that pjC specifies 
cell rates by an incremental amount. This basic mechanism 5 sorrje prese t rate threshold. Let r A (t) denote the total mea- 
imposes very small implementation burdens on switches and sured ABR traffic rate and let r^(t) denote the total measured 
requires minimal processing. Several mechanisms exist for background or guaranteed traffic (i.e., CBR and VBR) rate 

the switch to determine the onset of congestion, including . t - 0 • . , # . * t ; /A * - / A iU „ A 

the following EFCI schemes & at time interval t; let r A (t) and r^t) represent the correspond- 

Many first generation ATM switches implemented binary ft ^ low-pass filtered rates. A switch implementing the rate 

schemes based on a simple first-in-firsl-out (FIFO) queuing 10 folding scheme determines that congestion is imminent 

mechanism shared by all ABR connections. Under this basic at ^ interval x ™ d the EFCI blt when me agg^gate 

EFCI scheme, a switch determines the onset of congestion filtered rates exceed me rate threshold: 

at time interval t when the queue level q(t) in one of the r A (t)+? B (t)> ?7 c-*EFO. bit (data cell)-i 
switch's buffers exceeds some preset threshold q^ shown in 

FIG. 2 A: 15 By removing the high-frequency fluctuations, filtering 

improves stability and reduces oscillations. However, to 

g(0^?r— EFCI bit(data «ll)-i. ensure stability, the rate-thresholding scheme should be 

In some implementations of this scheme, the switch combined with the basic queue fill thresholding scheme, i.e., 

declares the onset of congestion if the instantaneous queue ^ EFCI bit should also be set when a queue threshold is 

size exceeds a threshold q^-^ shown in FIG. 2B, and detects 20 exceeded. ),sing low-pass filtered rates as a congestion 

a normal "uncongested" state when the queue size drops indicator improves performance by detecting congestion 

below another threshold q ;ow „ also shown in FIG. 2B. changes more rapidly. Depending on the burstiness of the 

This basic binary EFCI scheme, which detects nodal network traffic, p r may be tuned to values closer to 1. 
congestion by comparing the present queue size with a A larger value of p T decreases the safety margin of slack 
preset threshold, has several disadvantages. First, congestion 25 bandwidth that may be used to accommodate sudden bursts 
detection is delayed by the amount of time required to build in traffic. Hence, network operators that operate their net- 
up the queue. Second, the resolution of congestion is also works with a high p r run a higher risk of violating the 
delayed by the amount of time required to drain the queue. service requirements of some connections. The appropriate 
Such delayed detection followed by the unnecessary exten- utilization target of any given link depends on the charac- 
sion of congestion periods significantly increases the oscil- 30 teristics of the traffic flowing through it. If each source's rate 
lation period as well as the oscillation magnitude of ABR is small compared to link capacity (small grain size) and 
traffic within the network, causing a large consumption of bursts are short, the link's utilization target can be set higher, 
buffer resources. Because both the queue levels and the ABR Bursty sources with large, long bursts or long idle periods 
source rates oscillate continually, large buffers are required require a lower link utilization target, 
to maximize network utilization. 35 In addition to detecting congestion early and reducing 

A second binary EFCI scheme monitors the rate of cells oscillations in ABR traffic, the low-pass filtered rate- 
coming into the queue, R in FIG. 3, relative to a threshold thresholding scheme results in a substantially shorter time 
on a short interval basis. The threshold may be defined, for period of congestion than the basic queue-thresholding 
example, in terms of a targeted link utilization, e.g., 0.9C, scheme. Because of improved congestion detection 
where C is the link capacity. If R exceeds a predefined 40 performance, the buffer capacity required at each switching 
threshold, the switch declares congestion. In a simple imple- node is substantially reduced. 

mentation of this scheme, the switch declares congestion if A third binary EFCI scheme, illustrated in FIG. 4, moni- 

the count of queue input cells exceeds T^, in an interval of tors the change in queue size Q to detect the onset of 

7^. This scheme improves upon the first scheme by imple- congestion. In this scheme, the first difference of queue 

menting congestion avoidance, i.e., by controlling the 45 depth, AQ/At is used as the congestion indicator, allowing 

incoming rate, the switch avoids the onset of congestion tight control of the queue size. 

rather than detecting congestion after it occurs. The main advantage of existing binary feedback schemes 

In Y. Zhao, S. Q. Li, and S. Sigarto, "An Improved EFCI is their implementation simplicity. The first-generation ATM 

Scheme with Early Congestion Detection/' ATM Forum switches implement a simple FIFO queuing mechanism and 

Contribution AF-TM96-1084, February, 1996, the authors 50 mark the EFCI bit in the event of congestion. However, the 

proposed a variant of the rate-thresholding scheme called proper tuning of the rate increase and decrease parameters 

EFCI with Early Congestion Detection (EFCI-ECD). To (RIF and RDF) and the queue or rate threshold (q r or p r ) is 

improve network resource utilization, this scheme detects required. The maximum queue length and the achieved 

congestion by monitoring the low-pass filtered arrival rates. utilization depend on the proper tuning of all the parameters. 

Recent multimedia queuing analyses (S. Q. Li and C. 55 Specification of Oj. and p r yields vastly different buffer 

Hwang, "Queue Response to Input Correlation Functions: requirements and loads depending on the selection of RIF 

Continuous Spectral Analysis," JEEE/ACM Trans. and RDF. 

Networking, Vol. 1, No. 6, December, 1993, pp. 678-692; S. The main disadvantage of the existing binary feedback 

Q. Li, S. Chong, and C. Hwang, "Link Capacity Allocation schemes is that they are not fair in a multiple-oode network, 

and Network Control by Filtered Input Rate in High Speed 60 The schemes may cause some connections to have unfair 

Networks," IEEE/ACM Trans. Networking, Vol 3, No. 1, access to the available bandwidth depending on the network 

February, 1995, pp. 10-15) indicate that queue congestion is topology and end system behavior. It has been observed that 

mainly captured by low-frequency traffic behavior. Filtering sources which have several hops on their routes are more 

of the VBR traffic, for instance, eliminates the high- likely to have an EFCI bit set to 1 than sources with few 

frequency fluctuations which are easily absorbed by buffer- 65 hops. As a result, it is unlikely that long-hop connections 

ing; ABR rates should not be adjusted based on these will be able to increase their rates, and consequently they 

short-term VBR bursts. will be beaten down by short-hop connections. This has been 
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referred to as the beat-down problem (J. Bennet and G. T. be absorbed by the finite buffer capacity at each node. 

Des Jardins, "Comments on the July PRCA Rate Control However, many filtering techniques known or used today are 

Baseline," .4 7M Forum Contribution AF-TM 94-0682, July, generally not suitable for multimedia applications. For 

1994). example, in Li et al. and S. Chong, S. Q. Li, and J. Ghosh, 

In A. Arulambalam, X. Chen, and N. Ansari, "Allocating 5 "Dynamic Bandwidth Allocation for Efficient Transport of 

Fair Rates for Available Bit Rate Service in ATM Real-Time VBR Video over ATM/' INFOCOAT94, 1994, 

Networks," IEEE Communication Magazine, November, pp. 81-90, the authors use a Finite Impulse Response (FIR) 

1996, pp. 92-100, the authors propose several alternatives filter with a very sharp cut-off frequency. This filtering 

for alleviating the unfairness problem in binary feedback approach has certain limitations because the cut-off fre- 

schemes. The first alternative is to provide each connection 10 quency has to be known for each traffic stream. Although 

or group of connections with separate queues. Such a this approach may be applicable to video servers where the 

per-connection queuing mechanism makes it possible to filtered input rate can be precalculated and used at connec- 

determine the congestion status of a connection in isolation tion setup time, it is difficult to predetermine what the cut-off 

from the rest of the connections. Because only those con- frequency should be for an aggregate traffic stream resulting 

nections that cause the, congestion of their associated 15 from the multiplexing or concentration of different traffic 

queues are subject to congestion marking, the beat-down streams. Furthermore, simple filtering at a certain cut-off 

problem in the binary scheme can be eliminated. Thus, this frequency is not sufficient because there is no notion of 

isolation ensures fair access to buffer space and bandwidth intensity of the frequency involved. The approach adopted 

among all competing connections. by Zhao et al., in which the filtering function is implemented 

This per-connection queuing approach also enjoys several 20 by a simple short-term moving average operation, is also 
other distinct benefits. First, per-connection queuing can be unsuitable for multimedia connections because the moving 
used to help protect individual connections effectively since average filter cannot handle nonstationary signals or signals 
a misbehaving connection can, only cause its own queue to with significant transient components, 
overflow. Second, it allows the delay and the loss behavior Many traffic filtering mechanisms are designed in an ad 
of individual connections to be decoupled from each other, 25 hoc manner. Because the arrival and service rates in net- 
leading to better QoS control. Finally, per-connection infor- works are time-varying, it is crucial to estimate network 
mation is readily available to help the implementation of parameters by an adaptive procedure that takes changes into 
other types of congestion control, such as early packet account. Furthermore, since the estimator must be imple- 
discard (EPD) and partial packet discard (PPD) mecha- mented in real-time, its computational and space complexity 
nisms. 30 should be as small as possible. Time weighted moving 

Although per-connection queuing offers considerable averages (e.g., exponentially weighted moving average 

advantages over single FIFO queuing, this comes at the (EWMA)) have these properties and are often used in 

significant cost of switch design and implementation. Effi- practice. However, appropriate filter weights are difficult to 

cient buffer management and scheduling among the queues determine and are usually chosen heuristically. 

can be costly to design and implement. Such implementation 35 It is desirable, therefore, to provide a binary feedback 

may be prohibitive when the number of connections grows control scheme that monitors the queue input rate relative to 

substantially large. a threshold. It is more desirable to provide such a scheme 

The second feasible alternative is to provide selective or that operates on low-pass filtered traffic so that ABR rates 

intelligent marking on a common FIFO queuing mechanism are not modified based on fluctuations due to unfiltered VBR 

when congestion takes place. In this alternative, a switch 40 traffic. It is even more desirable to provide a filtering process 

computes a fair share and a connection is asked to decrease that is simple al d inexpensive to implement and is effective 

its rate by marking the EFCI bit only if its current cell rate in filtering multimedia application traffic. Finally, it is desir- 

is above the computed fair share. In this approach, cells able to provide a binary feedback control scheme that 

belonging to virtual circuits are selectively marked based on complies with existing standards, 

their current cell rates which are carried in the RM cells. In 45 OT nn , A _ rr ^ T ^ T _ vr 

addidontoEFamarkmg,CIandNIbitsintheRMcellscan SUMMARY OF THE INVENTION 

be used to regulate the rate at the source. This invention satisfies those desires by providing a flow 

Although per-connection queuing and intelligent marking control solution using a wavelet transform-based filtering 

provide good solutions for the beat-down problem, they do mechanism. 

not address the other problems of binary feedback schemes, so A system consistent with the present invention is for use 

In particular, as has been described, binary feedback in controlling congestion in a network with a network switch 

schemes are generally slow in response to congestion having a first buffer for receiving packets of a first type into 

because it may take several round-trips to alleviate conges- a first queue and a second buffer for receiving packets of a 

tion or utilize any unused bandwidth. Another problem is second type into a second queue. The system includes a first 

that binary schemes may generate large oscillations even at 55 measurement module coupled to the first buffer for measur- 

steady-state, making it difficult to achieve very high link ing an arrival rate of the packets of the first type, a second 

utilization. The best solution for improving stability and measurement module coupled to the second buffer for 

reducing oscillations in the ABR, cell rates and queue levels measuring an arrival rate of the packets of the second type, 

in a binary scheme is to implement a low-pass filtered and a flow control module coupled to the first and second 

rate-thresholding scheme that filters input traffic by remov- 60 measurement modules and the second buffer for determining 

ing the high-frequency fluctuations in the arrival process. a congestion state of the switch based on the arrival rates of 

The multimedia queuing analyses by Li et al. observe that the first and second type packets and the queue length of the 

in a finite-buffer system, the traffic arrival process is essen- second queue. The first measurement module includes a 

tially characterized by the low-frequency components, the sampler for sampling the arrivals of the first type packets and 

effective capacity at each node is essentially captured by the 65 a wavelet transform processor coupled to the sampler for 

low-frequency input traffic filtered at a properly selected determining an effective arrival rate of the first type packets 

cut-off frequency, and the high-frequency components can in response to the sampled arrivals. 
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Methods are also provided for carrying out the systems effective ABR rate (C a ) and queue status (q^) to detect 

consistent with the present invention. congestion and outputs a decision related to flow control, 

Consistent with the present invention, the first type pack- e -g > & c EFCI bit should be set to 1 because congestion has 
ets comprise CBR and VBR packets, and the second type been detected. Consistent with the present invention, band- 
packets comprise ABR packets. However, it should be 5 width measurement module 154 and flow control module 
appreciated by one skilled in the art that systems and 156 may be implemented as software stored in a memory in 
methods consistent with the present invention have a poten- ^ c ^™ switch and executed by a processor in the ATM 
tially broader application to flow control in other packet switch. The memory may be any type of computer-readable 
networks. medium, such as any electronic, magnetic, or optical read/ 

The advantages accruing to the present invention are 10 wnte stora S e device, 
numerous. Methods and systems consistent with the present As explained above, it is desirable to measure the band- 
invention provide an efficient and effective binary ABR flow width of incoming traffic in such a way that high-frequency 
control solution. The solution results in smaller oscillations components due to VBR traflic are filtered out. By extracting 
in ABR rates, and therefore smaller buffer requirements in the underlying low-frequency traflic profile, bandwidth mea- 
network switches. 15 surement module 154 provides flow control module 156 

Hie above desires, other desires, features, and advantages J 1 more sta ^ le effective bandwidth measurement on 

of the present invention will be readily appreciated by one wtucn 10 base a flow contro1 decislon - 

of ordinary skill in the art from the following detailed 11 is also desirable that the filtering mechanism be adap- 

description of the preferred implementations when taken in 2Q ^ve and computationally simple. Time weighted moving 

connection with the accompanying drawings. average schemes have these properties, but it is difficult to 

choose correct filtering weights for such schemes, 

BRIEF DESCRIPTION OF THE DRAWINGS Therefore, an alternative signal analysis method is needed to 

. . .„ . « « . - effectively filter the high-frequency components. 

FIG. 1 is a diagram illustrating the basic operation of ABR _ . - . , , . , . „ _ 

congestion control' or ^ 0ne exam pj e 0 f a S1 g na i analysis tool is the Fourier 

' Transform (FT), which has become a cornerstone of modern 

FIGS. 2A-B illustrate an EFCI congestion control scheme data aDa i ysis . The pr trans i at es a function in the time 

with simple queue fill thresholding; domain ^ a in the implex (real imaginary) 

FIG. 3 illustrates an EFCI congestion control and avoid- frequency domain, where it can be analyzed for its fre- 

ance scheme with queue input rate thresholding; 3Q quency content. The FT describes the original function in 

FIG. 4 illustrates an EFCI congestion control scheme with terms of orthogonal basis functions of sines and cosines of 

queue growth rate thresholding; infinite duration. The Fourier coefficients of the transformed 

FIG. 5 is a high-level block diagram of an ABR flow function represent the contribution of the sines and cosines 

control system consistent with the present invention; at each frequency. The FT is most commonly used in the 

err, s -n t , i c i w t i_ c j r>e f° rm °f the Discrete Fourier Transform (DFT), which ana- 

FIG. 6 illustrates an algorithm for calculating the forward 35 , v .™r , , 

fast wavelet transform; ^ ****** sam P led Ume ™* DFT works ™ der 

„ .„ . the assumption that the original time domain function is 

FIG. 7 illustrates an algorithm for calculating the inverse periodic in nature . :As a result? the DFT nas difficulty with 

fast wavelet transform; functions that have transient components, that is, compo- 

FIG. 8 illustrates components of an ABR flow control ^ nents which are localized in time. This is especially apparent 

system consistent with the present invention that includes a when a signal has sharp transitions. Another problem is that 

wavelet-based traffic filtering mechanism; the DFT of a signal does not convey any information 

FIG. 9 is a flowchart of a method for calculating effective pertaining to translation of the signal in time other than the 

capacity consistent with the present invention; phase of each Fourier coefficient. The phase values are 

FIG. 10 illustrates an example of a method for calculating 45 averaged for the input function. Applications that use the 

effective capacity consistent with the present invention; DFT oflen work around the first problem by windowing the 

FIG. 11 illustrates a timing diagram associated with a m ff da ' a mat tne ^mpled values converge to zero at the 

memodforcalcmaungeffectivecapacityconsistentvviththe Ai *fV* to .* e ^ cond P roble ?> suc *!^ e 

present invention* development of the short-time Fourier transform (STFI), 

_ * . c „ have met with marginal success. 

FIGS. 12A-B illustrate event diagrams associated with a 50 

method for calculating effective capacity consistent with the Wavelet Functions 

present invention; and T , , . „ 

TTrn n 11 * * Ann a .1 . ■ . < In recent years, new families of orthogonal basis func- 

HG 13 illustrates an ABR flow control system consistent ^ hayc becn that lcad * trmhms that 

with the present invention. 55 ovcrcomc ^ problems: of the DFT basis junctions 

DETAILED DESCRIPTION OF THE ™ ^ WaVe * ets ' ^ ^ * e m6 £ sine wa ^ e 

PREFERRED EMBODIMENTS ° f the ™} necd <° haVC ^ tc ^7 ca ?L bc 

non-zero tor only a small range of the wavelet function. This 

FIG. 5 illustrates, at a high level, an ABR flow control "compact support" allows the wavelet transform (WT) to 

system consistent with the present invention. The system, 60 transform a time domain function into a representation that 

shown generally by reference numeral ISO, operates within is not only localized in frequency (like the FT) but in time 

a network switch, such as an ATM switch. Incoming CBR/ as well. With the frequency content evolving with time for 

VBR cells arrive at the switch and enter buffer 152. Band- most real-time signals (including traffic in computer 

width measurement module 154 determines the effective networks), time-frequency analysis using wavelets has 

bandwidth utilized by current CBR/VBR traffic based on 65 become a more powerful tool. 

information received about cell arrivals. Flow control mod- The fundamentals of wavelet functions will now be 

ule 156 uses the effective CBR/VBR bandwidth and the described to the extent they pertain to the present invention. 



05/19/2004, EAST Version: 1.4.1 



US 6,584,111 Bl 

9 10 

Details can be found in S. G. Mallat, "A Theory for 

Multiresolution Signal Decomposition: The Wavelet j (7) 

Representation," IEEE Traru Pattern Analysis and Mack. g(0 = £ c <^ ,w + Z Z 

Intelligence, Vol. 11, No. 7, July, 1989, pp. 674-693, and 0. teZ jsl * cZ 
Rioul and M. Vetterli, "Wavelets and Signal Processing," 5 

IEEE SP Magazine, October, 1991, pp. 14-38, both of which j^e expansion coefficients c, represent the approximation 

are incorporated herein by reference. 0 f me original signal g(t) with a resolution of one point per 

There are two fundamental equations upon which wavelet every 2' points of the original signal. The expansion coef- 

calculations are based. These are the scaling function (also ficients d,^ represent the details of the original signal at 

called the dilation equation or fundamental recursion) and 10 different levels of resolution. These coefficients completely 

the primary wavelet function (mother wavelet): and uniquely describe the original signal and can be used in 

ways similar to the Fourier transform. The Wavelet 

<t>(r)-2 4 ^ A 4>(2(-/c); ip(0-S*«z(-i*)«*-i«£f+i). (1) Transform, then, is the process of determining the values of 

. ™ • «. * c ■ t j ,i w Oj and d /Jt for a given g(f) and wavelet system, 

where Z is toe set of integers and the a* are the wavelet 15 * ^ ^^rim equation naturally leads to a recursive 

coefficients. Both of these functions are two-scale difference fa ^ wavekt tnnafom tf some a^ptions are 

equations and are the prototypes of a class of orthonormal ^ ^ ^ g(t) {& taken M a m of 

basis functions of the form: discretc vohi% y samplcd at 2 ™ points pcr unit mleival 

^t)^(i J t-k) ; j t kez^t)=2^t-k); hkZ. (2) P oints can bc vicwe d as the inner product of $ and 

20 g(t). That is, the sample points are an approximation, or c, 

The parameter j controls the dilation or compression of coefficients, of the continuous function g(t). This allows the 

the function in time scale as well as in amplitude. The c, and d Jfk terms to be calculated by convolution of the 

parameter k controls the translation of the function in time. samples of g(t) with the wavelet coefficients a A . Daubechies 

The set ofbasis functions formed by <(>(t) and \p(t) is a system has discovered that the WT can be implemented with a 

of scaled and translated wavelets. 25 specially designed pair of finite impulse response (FIR) 

Wavelet systems can be either real or complex-valued, filters called a quadrature mirror filter (QMF) pair. A FIR 

though most research has used real-valued wavelet systems. filter performs the dot product (or sum of products) between 

Wavelet systems may or may not have compact support, the filter coefficients and the discrete data samples. The act 

Wavelets have compact support if and only if they have a of passing a set of discrete samples, representing a signal, 

finite number of non-zero coefficients. Since compact sup- 30 through a FIR filter is a discrete convolution of the signal 

port is what gives wavelets the ability to localize in both with the coefficients of the filter. 

time and frequency, only wavelets of that type will be The outputs of the QMF filter pair are decimated (or 
described. down-sampled) by a factor of two, that is, every other output 
Several techniques have been used to create wavelet: sample of the filter is kept, the others being discarded. The 
systems. These include cubic splines, complex exponentials, 35 low-frequency (low-pass) filter output is fed into another 
and parameter space constructions. Parameter space con- identical QMF filter pair. As illustrated in FIG. 6, this 
structions have been used to construct wavelets which are operation can be repeated recursively as a tree or pyramid 
orthonormal bases. The Haar wavelet (the first wavelet algorithm, yielding a group of signals that divide the spec- 
discovered) and the Daubechies wavelets belong to this trum of the original signal into octave bands with succes- 
class. Orthonormal wavelets allow perfect reconstruction of 40 sively coarser measurements in time as the width of each 
a function from its wavelet transform coefficients by the spectral band narrows and decreases with frequency, 
inverse WT. The following set of conditions must be satis- Mallat has shown that the tree or pyramid algorithm can 
fied before a set of coefficients can represent an orthonormal be applied to the wavelet transform by using the wavelet 
wavelet: coefficients as the filter coefficients of the QMF filter pairs. 

45 The same wavelet coefficients are used in both the low-pass 

^a 2A = l/V2~; ^aat+i = i/vT; ^ and high-pass filters. The low-pass filter coefficients are 

*«z *«2 associated with the a* of the scaling function <|>. The output 

V atcn y - 0 for / * 0- V aro* - J °^ ea °k l° w "P ass fil ter * s °i> or approximation components, 

+2/ ~ of the original signal for that level of the tree. The high-pass 

so filter is associated with the a k of the wavelet function xjj. The 
output of each high-pass filter is the 6 Jk , or detail 

where a* is the complex conjugate of a^. components, of the original signal at resolution 2 J . The c, of 

Once a wavelet system is created, it can be used to expand the previous level are used to generate the new c, and d /Jt for 

a function g(t) in terms of the basis functions: the next level of the tree. The decimation by two corresponds 

55 to the multi-resolution nature (the j parameter) of the scaling 

« ~ ~ (4) and wavelet functions. 

g{t) ~ ^ Cim *^^J J ** jm The inverse DWT essentially performs the operations 

" ~ J associated with the DWT in the opposite direction. The 

inverse transform coefficients are combined to reconstruct 

with the coefficients calculated by the inner product as: 60 the original signal. The same a A coefficients are used in the 

forward transform, except the ordering of the coefficients is 

crM0lfiM*(0*/(O* (5) reversed. As illustrated in FIG. 7, the process works up the 

^^iMihl&NiAW* C 6 ) branches of the tree combining the approximation and detail 

signals into approximation signals with higher levels of 

If the wavelet system has compact support and an upper 65 detail. Instead of decimation, the signals are interpolated, 

limit J is placed upon the degree of dilation j, then the that is, zeroes are placed between each approximation and 

expansion equation becomes: detail sample and the new signals are passed through the 
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low-pass and high-pass filters, respectively. The intermedi- 
ate zero values are replaced by "estimates" derived from the 
convolutions. The outputs of the filters are then summed to 
form the approximation coefficients for the next higher level 
of resolution. The final set of approximation coefficients at 5 
the top of the tree in the reverse transform is a reconstruction 
of the original signal data points. II 

An added advantage of the DWT is that it is more 
computationally efficient than the FFT. An FFT of length N 
(where N is an integral power of 2) requires 0(Nlog 2 (N)) 10 
operations. A DWT of length N requires only O(N) opera- 
tions. The FIR filters are linear processing elements. Only 
half as many operations are performed on each level of 
resolution in the tree algorithm as in the previous level. 

Because data traffic in computer networks is generally 15 
nonstationary (i.e., varying with time) and possesses some 
form of statistical self-similar (also known as fractal) 
behavior, analysis using wavelet transforms is well-suited to 
such traffic. Self-similarity in networking traffic manifests 
itself in burstiness across a wide spectrum of time scales. 20 
Self-similar traffic models have been created that can cap- 
ture correlation structures observed in networking traffic. On 
the other hand, the mathematical analyses of these models 
are often extremely difficult. The wavelet transform enables 
analysis of the traffic without assumptions with respect to 25 
either the stationarity of the data or the statistical distribution 
embodied in the data. These features are retained because 
the WT is a linear operator that preserves the time localiza- 
tion of the data in the transform coefficients at the different 
levels of scale. 30 

Discrete Wavelet Transform (DWT) Algorithms 

Methods for signal decomposition and signal reconstruc- 
tion using discrete wavelet transforms to be used in filtering ^ 
methods consistent with the present invention will now be 
presented. The DWT may be computed recursively as a 
series of convolutions and decimations. At each scale level 
j an input sequence S'^n) is fed into a low-pass and 
high-pass filter, H(z) and G(z), respectively. The output from ^ 
the high-pass filter G(z) represents the detail information in 
the original signal at the given scale j, denoted by W(n). The 
output of the low pass filter H(z) represents the remaining 
(coarse) information in the original signal, and is denoted by 
S^o). H(z) and G(z) are the z-transforms of the set of 
coefficients representing the scaling function <|>(t) and the 
wavelet function \|>(t), respectively. 

The sequential DWT algorithm may be expressed as 



5''(n)=2 J ^''- 1 (A)ff(2fi-A), 



the filters, at scale j, are valid DWT coefficients, and L/2-1 
outputs are discarded. This assumption is by no means 
limiting as far as the architecture is concerned. It is always 
possible to derive different arrays with similar characteris- 
tics given any other solution to the edge effects problem. 

The following pseudo-code represents the DWT algo- 
rithm for signal decomposition in nested loop form, with 
adjusted limits accounting for edge effects. The input is 
X(n)=S°(n)> S>(n) and W'(n) correspond to the detail and 
coarse sequences shown in equations (8) and (9). The index 
space of the algorithm is 



/ = 



1 £k z2n iornzLfl 
2n-L+lzk sin farn> L/2 



(10) 



45 



(S) 
(9) 



50 



At the output of every level there is a downsampling 
operation which appears as the factor two in the argument of 
the filter coefficients. 55 

One problem associated with the DWT of finite duration 
signals occurs at the boundaries of the input sequence and is 
referred to as the edge effects problem. This problem occurs 
in any transformation such as a DWT that uses convolution 
on finite length signals. Edge effects are a consequence of 60 
the output of an FIR filter being L-l samples longer than the 
input sequence. Although several edge effects solutions have 
been proposed, such as reflecting the end points of the input 
or periodically extending them, solutions tend to be 
application-specific. In order to clearly specify the behavior 65 
of the algorithm at the edges of the index space, the 
following assumption is made. The first M/2? outputs from 



Without loss of generality, the harmless assumptions are 
made that M and L are powers of 2 and that all indices start 
from 1. M=2 m and J are the length of the original signal 
vector and the maximum number of levels for signal 
transformation, respectively, 
procedure SIGNAL_DECOMP [X(n), H, G, M, J] 
X(n)=input signal vector 

H=low-pass wavelet filter coefficients for signal decom- 
position 

G=bigh-pass wavelet filter coefficients for signal decom- 
position 

1. S°(n)=X(n), Sf(n)-0, Vj, Vn 

2. for jol to J 

3. for n-1 to M/2f 

4. for k-max (1, 2n-L+l) to 2n 

5. S'tn^S'W+S'- 1 ^) H(2n-k) 

6. W'^W'^+S^Oc) G(2n-k) 

7. endfor(k) 

8. endfor(n) 

9. endforQ) 

10. output-S^n), W'(n), j-1, 2, . . . , J 

Similarly, the inverse DWT can be used for signal recon- 
struction. In methods consistent with the present invention, 
as will be detailed below, only partial reconstruction of the 
signal is required to determine the high-frequency ("noise") 
component of the signal. Therefore, the following pseudo- 
code represents an algorithm for reconstruction of only the 
high-frequency component. 

procedure PARTIAL_RECONS [W'(n), j=l, 2, . . . , scale, 
ft, 6, scale] 

(This procedure computes the high-frequency ("noise") 
component at level scale. The reconstruction is done 
from level scale to level 1, where scaled 1.) 

ft-low-pass wavelet filter coefficients for signal recon- 
struction 

G=high-pass wavelet filter coefficients for signal recon- 
struction 

W"(n), j-1, 2, . . . , scale-output of high-pass filter. (W*(n) 

is the highest frequency sub-band) 
M=2 w «length of original signal 

1. for jascale to 1 

2. for n=l to M/2 0 '-^ 
S*'- l (n)=0 
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3. for A(t), t=0, 1, 2, 3, . . . : Equidistant traffic measurements 

(cell counts) 

* = nu*(l.[^Ul) t or?l n " 0 ' 1 ' 2 '- •= Window number 

2 2 Low-frequencies in window n 

5 H„(t): High-frequencies in window n 

S~>'-\n)-S*>'-\n)+W(k) 6(n-2 (k-1)) C„(t): Wavelet coefficients (window n) 

if (j<scale) C*_(t): Modified wavelet coefficients (window n) 

S^(n)^S^(n)4S^00 fi(n-2 (k-1)) ^ 2 m. Window size 

endfor(k) 10 Ca2 * k>0: Caicjation rate 

endfor(n) ^ 1? 2 , . . . , m: Iteration (search) level 

endfor(i) B: Buffer size (in cells) 

4. output-S (n), iv-1, 2, . . . , M & buffer utilization> 0<?<1 

Bandwidth Measurement A(w+nc-l): Right border of window n 

Referring again to FIG. 5, a flow control system consis- A ( nc ) : ^ border of window n 
tent with the present invention includes bandwidth measure- ^ original traffic measurement curve of a window is 
ment module 154 and flow control module 156. As s P ht mt0 a low-frequency component L and a high- 
explained, it is desirable that bandwidth measurement mod- frequency component H according to 
ule 154 generate an effective capacity of CBR and VBR 2 o 

traffic that has had high-frequency components filtered out «e^£L; = + HnJ ^ 

in order to provide a stable input to flow control module 156, 

For the reasons described, traffic filtering mechanism based 

on wavelet analysis is effective for network traffic. where 

FIG. 8 illustrates some components of bandwidth mea- 25 

surement module 154 consistent with the present invention, f*^ 1 \ < 12 ) 

including a wavelet-based traffic filtering mechanism. This «ri![S«J 2^ "njWpffl* 
architecture is related to the wavelet-based connection 
admission control mechanism developed by P. Droz in 

"Traffic Estimation and Resource Allocation in ATM 30 is the break criterion between L and H. That is, to separate 

Networks," Diss ETH Zurich, No. 11462, 1996 (found at the high-frequency component H from the low- frequency 

http:HAvww.zurich.ibm.com/-dro/) ("Droz I"), and "Esti- component L, the method searches for the maximum H that 

mation and Resource Allocation Based on Periodical Wave- fulfills equation (12), For window n at iteration level j, the 

let Analysis" 4th UK/Australian International Symposium left hand side of equation (12) is determined by executing 

on DSP for Communication Systems, September, 1996 35 the following search criterion on the vector of high- 

("Droz 11"), which are both incorporated herein by refer- frequency components [ij i=l, 2, . . . , 2 m , where 

ence. max_dev is the maximum deviation of noise from zero and 

Sampler 160 generates periodic measurements of the ^ determined by procedure new_cumsum [H^"*)], illus- 

incoming CBR/VBR traffic arriving at buffer 166. These *ated by the following pseudo-code: 

measurements consist of cell counts divided by At, where At 40 procedure new__cumsum [x(M)] 

is the sampling interval. The sampling frequency depends on (x is a vector with M elements and x(l) as the first 

the link speed and the buffer size. Sampler 160 feeds its element) 

output into wavelet transform-based processor 164. In peri- Noise-0 

odic intervals, processor 164 analyzes the samples and max dev=0 

calculates the effective capacity. Buffer manager 162 checks 45 for i=l to M 

the queue length in buffer 166 and signals processor 164 if if (Noise+x(i)<0) 

the queue length exceeds a certain threshold or if cell loss Noise=0 

occurs. Buffer manager 162 also signals processor 162 if a else { 

certain period of time passes without cell loss. Processor 162 Noise«Noise+x(i) 

responds to these signals by adjusting the traffic filtering 50 max__dev=max(max__dev, Noise) 

process. } 

Effective Capacity endfor 

^ 3 output max_dev 

Consistent with the present invention, processor 164 In the above procedure, the accumulant is never set below 

implements a method for calculating the measurement- 55 zero because in reality the available capacity in the past 

based effective capacity using wavelet-based filtering. A cannot be accumulated for use in the future. This is a 

method consistent with the present invention contains modi- modification of the cumsum(x) function provided in Droz I 

fications and enhancements to the algorithm given in Droz and Droz II, which is defined as the cumulative sum of the 

I. In general, sampled traffic is separated into signal and elements of x. That is, the ith element of cumsum(x) is 

noise, i.e., a low-frequency component and a high-frequency 60 I^/x,, where x y is the jth element of x. cumsum(x) carries 

component. By subtracting the high-frequency component over negative values in its computation, while new_ 

from the sampled traffic measurements, the method obtains cumsum(x) does not. 

the filtered (low-frequency) component of the original sig- The method searches iteratively for the strongest noise 

nal. The new effective capacity is the maximum of the that still fulfills equation (12) using wavelet transforms, 

filtered signal during the previous measured interval. 6 5 First, the wavelet transform W[ ] is executed to obtain the 

A method for calculating effective capacity consistent wavelet coefficients up to m levels of decomposition for 

with the present invention uses the following parameters: measurement window n: 
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C„(s)=W[A(s)fyc£sov+TK (13) which sets new_coeffs to the coefficients obtained by the 

wavelet transform, and then to step 214, which resets the 

Next, the method begins an iterative process using the lower ^ coefficients to zero . i n step 216, the noise is 

inverse wavelet transform W[ ], with j-0, 1, % . . . , m obtained by an mverse wavelet transform using the modified 

denoting the iteration level: 5 coefficients. Thus, in each iteration one more level becomes 

„ , . tjv / s-, ^ /. A part of the vector of high frequencies and all the lower levels 

*jv jja \ j m sct tQ Kr0i ^ov/ p roC eeds to step 218, in which a 

with cumulative sum function is applied to the noise, yielding a 

sequence of values, the maximum of which is the maximum 

c* flJ (j)={o if nc%s<nc+T n - i c n {i) ) otherwise (is) io deviation from zero. This value represents the maximum 

filling of the buffer during the observed interval. In step 220, 

After each back-transformation, the maximum criterion in the method determines whether the maximum deviation 

equation (12) is checked to find the most intensive but still from zero has surpassed the product of the buffer utilization 

acceptable noise. In each iteration one more scale of coef- and buffer size, i.e., gB. If max_dev has surpassed £B, 

ficients is counted to the high-frequency component, 15 found is set to TRUE (step 224), flow returns to step 208, 

whereas the rest of the coefficients are set to zero. Thus, the and then to steps 226-228 to calculate the effective capacity, 

method searches for the largest j that still fulfills the maxi- If max_dev has not surpassed £B, high_JL_old is set to 

mum criterion. The two extreme cases are j-0 and j-m. For high_f and flow returns to step 208. In step 208, if there arc 

j-0, all traffic is counted as low-frequency, whereas, for j-m, more levels of decomposition, flow proceeds to steps 

everything is counted as high-frequency. The estimate of the 20 210-220 to continue searching for the highest noise meeting 

effective capacity C, is then the maximum of the low- the break criterion. The break criterion is such that the 

frequency component during the window: maximum filling of the buffer is below the buffer size 

multiplied by the buffer utilization factor. If, on the other 

mu L„(s), ncZsKw+nc (16) hm ^ ^ inverse wave let transform has included every level 

which can be used for control purposes. If the wavelet 25 of wavelet decompositi^ 

processor uses cell counts directlyf the effective capacity (in ste f *° determine the effective capacity, 

cells/second) is given by the output of the processor (in cell *° Step 2 K 26 ' ^k^"** ™£ r 15 com P* ted « 

counts) divided by the sampling interval At (in seconds). ^f™ the ve f tor / ^ measurements and 

FIG. 9 illustrates a flowchart of amethod consistent with old ^"frequency ^vector. For the case in which step 224 

the present invention for calculating the effective capacity 30 ^^f^T* ™ & u?u?£ 

using wavelet transforms. The method uses the following the f d °^que^y vector is tfie last one ^suchthat^B 

parameters' was surpassed. For the case in which step 224 was never 

- „ , „. . reached because all levels of decomposition were analyzed 
coeffs: vector of wavelet coefficients surpassing the old high-frequency vector is set 
high_f: vector of high frequencies 35 to the current high-freqnency vector in step 222. In step 228, 
low_f: vector of low frequencies the effective capacity c is computed as the maximum value 
X: vector of traffic measurements m ^ low-frequency vector. 

l. 1 „ r , 1 . a- - 4 /c • 1 That is, the method obtains the filtered (low-frequency) 

h: low-pass wavelet coeflicients (for signal \ - it . . , . , , , v . , H , J) 

decomposition* component or the original signal by subtracting the noise 

- . . - „ . 40 (high-frequency) component from the traffic measurements, 
g: high-pass wavelet coefficients (for signal The effective capacity for the nextperiod is then taken as the 

decomposition) maximum of the filtered signal during the previously mea- 
se ale: current level of wavelet decomposition sured interval. 

nr__of_scales: number of levels of wavelet decomposi- The following pseudo-code represents the computation of 

tion 45 the effective capacity C e illustrated by the flowchart of FIG. 

rh: low-pass wavelet filter coefficients (for signal 9 ; ^ procedure calls the signal decomposition and partial 

reconstruction) signal reconstruction procedures described earlier. 

rg: high-pass wavelet filter coefficients (for signal f^™? EFFECTIVE^CAP [X(n), M, H, G, ft, G, J, 

reconstruction) buffer size buffer_ut,hzation] 

/ so 1. found=FALSE 

max_dev: maximum deviation of noise from zero 

reset: set coarse and detail coefficients of level scale+1 

and lower all to zero T 

f a ttj t TUf ■( - u- i_ jl. , In P ut signal vector, X(n), n-1, 2, .... M, M-2 W , J^m, 

found: TRUE if ma»mum high-frequency component m is any positive imeger 

sausfymgequahon(12) is found, FALSE otherwise ss 2 [s , (n) . 2, . . , ^-procedure SIGNAL 

After lnitiahzmg found to FALSE (step 200), scale to 0 ncrnMP TvL\ a A \a n ~ 

(step 202) and high_JL_old to 0 (step 204), the wavelet 3 ^Me Ko2 && sSe^ { 

transformation is executed to obtain the vector of wavelet 111 

coefficients coefls. The inputs to the wavelet transformation ^q^" 50 

are the traffic measurements (X), the wavelet low- and 60 S (n)°Procedure PARTIAL„RECONS [W(n), j-1, 

high-pass filter coefficients (h and g), and the number of 2, . . . , scale, H, G, scale] 

levels of decomposition (nr_of_scales). max_dev=procedure new__cumsum[S ,0 (n)] 

The signal/noise separation is done in the loop (steps if (max„dev>buffer__size*buffer_utilization) { 

208-224). In step 208, if equation (12) is not satisfied found-TRUE 

(foundoFALSE) and the level of decomposition is less than 65 scale =scale-l 

the total levels of decomposition, flow proceeds to step 210, } 

which increases the level by one. Fbw proceeds to step 212, } endwhile 



scale=0 

J=number of scales 
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4. Effective_capacity, C tf =0 With different wavelet bases, it is possible to influence the 
if (scale==0) { speed of the noise detection and separation from the signal. 

for n=l to M Droz I suggests that the above algorithm can be improved by 

C e =max (C e , X(n)) performing the wavelet transformation analysis incremen- 

endfor(n) 5 tally in the loop. Pipelined implementation of the, transfor- 

} else { mation is also possible, as well as parallel implementations. 

S"°(n)-procedure PARTI AL_RECONS [W'(n), j-1, 2, The most efficient way, however, is to interpret the coeffi- 

. . . , scale, H, G, scale] cients directly on the different scales. Such an interpretation 

for n=l to M is strongly influenced by the chosen wavelet basis. 

C>max(C„X(n)-S») 10 

endfor(n) Samphng of Cell Arrivals 

5. Convert C e to Mbps. If X(n) is given in cell counts, then , R 5£™« a f ^ to 8, bandwidth measurement mod- 
C, inMbps=(C e in cell counts* 53* 8ysampling interval At ^ 154 mcl * d f ^^I^^J pC Il odlC mea " 
in seconds) surements of the incoming CBR/VBR traffic. These mea- 
FIG. 10 illustrates an example implementation of the 15 surements consist of cell counts divided by At, where At is 

signal/noise separation for the calculation of the effective the sampling interval. Tlie samphng frequency depends on 

capacity detailed in FIG. 9 using a 3-level wavelet decom- the link speed and the buffer size. Sampler 160 feeds its 

position. Steps indicated in FIG. 10 correspond to those in output into wavelet transform-based processor 164. In peri- 

FIG. 9, and not all steps are shown in 110 for the sake of odic intervals, processor 164 analyzes the samples and 

clarity. First, a wavelet transform is performed on the input 20 calculates the effective capacity, as detailed above. A sam- 

vector X (step 206), yielding the coarse wavelet coefficient pling method consistent with the present invention is based 

c3 and the detail coefficients dl, d2, and d3. Next, scale is on one given in Droz I. 

set to 1 (step 210), and the reset function sets all coefficients ^ sampling freque ncy is given by the maximum 

below level 1 (i.e., all except dl) to zero (step 214). An ^ ^ (of of me &M) Qn a 155 ^ ^ a 

^f^'^u F**^j^^ ^ ^^ij^fp^iP v first vector 25 ^ ^ ^ £ 2 ?35 mk / roseconds at £ u load 

of nigh frequencies, nil (step 216). If the maximum devia- 0 t J , . t , 

r • r * * *\. * L j *- ,i Sampling at maximum speed can yield only two different 

tion of noise from zero is greater than the product of the * i ^ « . ' . i4 , . 4 , 

, rr i V c . j 4 , , ff .-i re 4 - measurement values: either a cell has been transmitted in the 

buffer utilization factor and the buffer size, the effective , 4 . 4 . 4 „, , - 

. 4l _ , c 4 . . . ' v , . slot or it has not. Thus, when sampling at maximum speed 

capacity is the maximum value of the input vector X (step . A . 4 A ' * i . f « 

, r _ 4 . r ,, « it is not possible to observe any mean value, only noise, lb 

228). The high-frequency vector is not subtracted because 30 ~ r . 17 . ' . u 

7 . ■ j i_ aL ^ * i 1 r perform a signal/noise separation where the noise is 

the search criterion was surpassed by even the first level of r L . j ■ iL « « iL i« c iL , 

, T - . j * *j r • t„ absorbed in the buffer, the sampling frequency must be high 

decomposition. If the maximum deviation of noise from « *• f L « !«• l *. *. 

. r . . 4U - . f . ~ enough to capture a potential buffer filling between two 

zero is not greater than the product of the buffer utilization \T ,\ r - iL . & , . „ 

c * j iu u cc - . samphng points. Therefore, the maximum number of cells 

factor and the buffer size, processing continues at the a. * • ■ *u *■ • * 1 u *. u, 

, . . , o 1 * • j* <->/* iia\ j _ that arrive m the time interval between the two measurement 

beginning of the loop. Scale is increased to 2 (step 210), and 35 . . , , , , , - „ 4U u , 

, , ° . * ii a= • * i_ 1 points must be below the buffer capacity. On the other hand, 

the reset function sets all coefficients below level 2 to zero f . , . L1 4 . , . j * ■! j • r • 4 .< 

/ * ii j\ a 1 * * r _c j it is desirable to obtain more detailed intormation about the 

(step 214). Again, an inverse wavelet transform is performed n • i r v .uul 

r-u 7 ^, . ... . , « 5 . cell arnval process. Therefore, the samplmg rate should be 

to yield a second vector of high frequencies, nf2 (step 216). r^- r ^ L rc • I * i i_ 

T -; u . j . .. * . 1_ • . *i_ a fraction of the buffer size to capture a potential buffer 

If the maximum deviation of noise from zero is greater than EWfa between two sam lin oints 

the product of the buffer utilization factor and the buffer size, 40 ^ P S P • 

the effective capacity is the maximum value of the input Hie samphng interval At must be chosen sufficiently large 

vector X minus the first high-frequency vector, hfl (step t0 obtain m accurate rate measurement and allow imple- 

228). If the maximum deviation of noise from zero is not mentation; too large an interval, however, will lead to 

greater than the product of the buffer utilization factor and sluggish response. The measured rate is a multiple of the rate 

the buffer size, processing continues at the beginning of the 45 rcsolution ^ me interval At must be selected so that 

loop. Scale is increased to 3 (step 210), and the reset measurement of a solitary cell will not result in the rate 

function sets all coefficients below level 3 (only the coarse bem S kr B e compared to the link capacity. As a rough 

coefficient) to zero (step 214). Again, an inverse wavelet guideline, At should be sufficiently large to produce a rate 

transform is performed to yield a third vector of high resoluuonof 0.5% of the target rate, Pl C, for instance. Thus, 

frequencies, hf3 (step 216). If the maximum deviation of 50 At ^ 0}li6 be chosen to be approximately l/(0.005 Pr C). 

noise from zero is greater than the product of the buffer For a 155 Mbps link and a 1000-cell buffer, a sample once 

utilization factor and the buffer size, the effective capacity is about every millisecond is a good value; it equals a maxi- 

the maximum value of the input vector X minus the second mum °f -*65 
high-frequency vector, hf2 (step 228). If the maximum 

deviation of noise from zero is not greater than the product 55 ( 155 Mbpsxio -3 ] 

of the buffer utilization factor and the buffer size, the |s bit&/bytex53 bytes/cell]' 
effective capacity is the maximum value of the input vector 
X minus the third high-frequency vector, hf3 (step 228). 

An advantage in using the wavelet filtering technique which is about x h of a frequently used buffer size in the range 
described is the ability to include system parameters such as 60 of 1000 cells. Higher sampling frequencies are possible but 
buffer utilization targets in the filtering process, which is not they increase the number of samples in the window, 
possible with the moving average or exponentially weighted As illustrated in FIG. 11, the effective capacity is con- 
moving average techniques. One guideline for selecting |B dnuously calculated using wavelet transforms applied on 
is so that the target buffer utilization level does not exceed partially overlapping windows. The overlap is needed to 
a maximum permissible delay. That is, ^B<d max9 where d max 65 capture trends across the borders of conservative windows, 
is a maximum permissible delay. For example, £B can be and the amount of overlap can depend on the extent of 
chosen such that £B^3ms. adaptability expected in the measuring process. Of course, 
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the specific settings that are appropriate in any deployment 
environment depend on the characteristics of that environ- 
ment (i.e., the traffic characteristics and link bandwidth), on 
how much ABR traffic a network operator wishes to admit 
on a link, and on the level of risk the network operator is 
willing to take that the service requirements will occasion- 
ally be violated. 

Buffer Manager 

Referring again to FIG. 8, bandwidth measurement mod- 
ule 154 includes buffer manager 162 for monitoring the 
queue in buffer 166 and signaling wavelet transform pro- 
cessor 164 accordingly, which responds to these signals by 
adjusting the traffic filtering process. For example, if the 
queue length in buffer 166 exceeds a threshold or if cell loss 
occurs, buffer manager 162 reduces the buffer utilization 
factor, 5, and passes the new factor to wavelet transform 
processor 164. This makes flow control schemes consistent 
with the present invention more conservative in the sense 
that less filtering of high-frequency components is done. 
Typically, the current buffer utilization factor might be 
multiplied by a factor of 0.8. After a certain period of time 
has passed without cells being lost, buffer manager 162 
increases the buffer utilization factor and passes this value to 
wavelet processor 164, causing more filtering of high- 
frequency components. A typical multiplication factor is 1 .1 , 
which increases the link utilization by 10%. In general, the 
reduction factor should be more drastic than the increase 
factor to avoid strong oscillations between cell loss and 
no -loss periods. 

In high-speed networks such as ATM networks, a cell loss 
rate of 10~ 8 or smaller is frequently requested. Therefore, 
before buffer utilization can be increased, at least 10 s cells 
must have been transmitted without losses. A higher value of 
approximately 10 10 cells can be used to decrease the chances 
of reverting back to a loss period. Thus, on a 155 Mbps link 
under full load, a period of 7.5 hours must pass before buffer 
utilization can be increased. For this reason it is preferable 
to engineer and fix the utilization on links with speeds below 
2 Gbps. Only on high-speed links should buffer manager 162 
signal wavelet transform processor 164 to adjust its filtering 
based directly on the cell loss rate because only on a 
high-speed link can this parameter be measured with good 
confidence in a reasonable time. On a 2.4 Gbps link at full 
load, for example, 10 10 cells are transmitted in a period less 
than 30 minutes, so the cell loss rate can be measured fairly 
accurately. The higher the link speed, the faster the cell loss 
rate can be measured accurately. 

Bandwidth Measurement Event Diagrams 

FIGS. 12A-B illustrate event diagrams for the wavelet- 
based traffic filtering performed by processor 164 in 
response to the traffic sampled by sampler 160. At is the 
sampling interval and Ac represents the computation time 
for obtaining the effective capacity C e . In FIG. 12A, At is 
greater than or equal to Ac. A first window begins at nc, and 
a second window begins at (n+l)c. The minimum possible 
difference between the start of the first and second window 
is At because each window must start at the beginning of a 
sampling interval, and the effective capacity must be 
obtained for the first window before the end of the second 
window. On the other hand, in FIG. 12B, At is less than Ac. 
Here, the second window cannot begin At after the first 
window because the effective capacity for the first window 
has not yet been obtained. Rather, the minimum shift 
between the first and second windows is 6 At, where 8=] Ac/ 
Atf, i.e., the next integer greater than or equal to Ac/At. 
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Flow Control 

FIG. 13 illustrates an ABR flow control system consistent 
with the present invention and shows the system illustrated 

5 in FIGS. 5 and 8 in more detail. Cell arrivals are divided into 
CBR/VBR traffic and ABR traffic, each of which enters a 
queue in its own set of buffers, 166 and 170, respectively. 
The portions of the bandwidth measurement module per- 
taining to CBR/VBR traffic are sampler 160, buffer manager 

io 162, and wavelet transform processor 164, as were also 
shown in FIG. 8. The output of wavelet transform processor 
164 is the effective capacity, C„ of CBR/VBR cell arrivals, 
i.e., the low-pass filtered bandwidth obtained as described 
above. ABR arrival rates are also measured by short-term 

15 rate measurement module 172, whose output is the ABR 
rate, C a . In a system consistent with the present invention, 
measurement module 172 can measure ABR arrivals using 
simple short-term instantaneous rate measurements. 
Alternatively, measurement module 172 may implement an 

20 effective capacity estimator similar to the one used to 
measure CBR/VBR arrivals. 

Flow control module 156 uses the CBR/VBR effective 
capacity, C e , the ABR capacity, C a , and the ABR queue 
length to detect early congestion and outputs a decision 

25 related to flow control, e.g, the state of congestion of the 
switch. In a method consistent with the present invention, 
flow control module 156 detects congestion when the total 
traffic rate C e +C a exceeds a threshold, e.g., a percentage of 
the available capacity on the outgoing link or virtual path, or 

30 when a queue threshold is exceeded. For example, conges- 
tion is detected when 

C m +C.*r)Coi q^q, (17) 

35 where r\ is a congestion detection thresholding factor (e.g., 
r|=0.9) and C is the link or virtual path capacity. Upon 
detecting congestion, flow control module 156 indicates 
congestion by, erg., setting the EFCI bit to 1 in headers of 
data cells transmitted downstream. 
40 The following pseudo-code illustrates the operation of an 
EFCI flow control mechanism consistent with the present 
invention for a single FIFO ABR queue: 
procedure EFCl_CO NTROL (EFCI Scheme with Rate 
Threshold Congestion Detection) 
45 1. {Initialization} 

Initialize rate threshold factor (e.g., t|-0.9); 

Initialize ABR traffic queue threshold, queue 

THRESHOLD; 
50 2. {At every cycle (window size)ow)} 

Update rate threshold factor t| and queue_ 
THRESHOLD, if necessary; 

3. Receive a data cell from upstream node of virtual channel; 
if (C e +C a ^r\C) or (queue_LENGTH^queue_ 

55 THRESHOLD) 

EFClol; 

4. Send data cell to the downstream node. 

It will be appreciated by those skilled in this art that 
various modifications and variations can be made to the flow 

60 control mechanism consistent with the present invention 
described herein without departing from the spirit and scope 
of the invention. Other embodiments of the invention will 
bee apparent to those skilled in this art from consideration of 
the specification and practice of the invention disclosed 

65 herein. It is intended that the specification and examples be 
considered exemplary only, with a true scope and spirit of 
the invention being indicated by the following claims. 
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We claim: 11. A method for controlling congestion in a network, the 

1. A system for use in controlling congestion in a network, network including a network switch having a first buffer for 

the network including a network switch having a first buffer receiving packets of a first type into a first queue and a 

for receiving packets of a first type into a first queue and a second buffer for receiving packets of a second type into a 

second buffer for receiving packets of a second type into a 5 sec ond qucuc having a queue length, the method comprising 

second queue having a queue length, the system comprising: me steps 0 f . 

a first measurement module coupled to the first buffer for . . , t f ,, « . , tu c , . 

_ ■ . , . ^fitJl u * p*u a ** measuring an arrival rate of the packets of the first type, 

measuring an arrival rate or the packets of the first type, . , ,. * 4l _ c 

■ I j« r 3r ' including the steps of 

a satnpkr for sampling the arrivals of the first type in sampling the arrivals of the first type packets; and 

packets- and determining an effective arrival rate of the first type 

a wavelet 'transform processor coupled to the sampler packets in response to the sampled arrivals using 

for determining an effective arrival rate of the first wavelet transforms; 

type packets in response to the sampled arrivals; measuring an arrival rate of the packets of the second 

a second measurement module coupled to the second type; and 

buffer for measuring an arrival rate of the packets of the 15 determining a congestion state of the switch based on the 

second type; and arrival rates of the first and second type packets and the 

a flow control module, coupled to the first and second q Ueue length of the second queue, 

measurement modules and the second buffer, for deter- 12 , ih e method of claim 11 wherein the step of measuring 

mining a congestion state of the switch based on the me arriva3 rate of first type pac kets further comprises the 

arrival rates of the first and second type packets and the m steos 0 f 

queue length of the second queue. . . 

2. The system of claim 1 wherein the first measurement monitoring the length of the first queue; and 

module further comprises a buffer manager coupled to the generating a signal if the queue length exceeds a prede- 

first buffer and the wavelet transform processor, the buffer termined threshold; 

manager including 25 and wherein the step of determining the effective arrival rate 

means for monitoring the length of the first queue; and of the first type packets is further in response to the signal, 

means for outputting a signal if the queue length exceeds a 13. The method of claim 11 wherein the step of deter- 

predetermined threshold; mining ao effective arrival rate of the first type packets 

and wherein the wavelet transform processor determines includes the step of separating the sampled arrival rate into 

an,effective arrival rate further in response to the signal. 30 a high-frequency component and low-frequency component. 

3. The system of claim 1 wherein the wavelet transform 14 ^ method of claim u wherein the step 
processor includes means for separating the sampled arrival mcludes the step of implementing a discrete wavelet trans- 
rate into a high-frequency component and a low-frequency f orm 

TO1 ^P°? ent ' * r i ■ <* l • *i_ 15. The system of claim 13 wherein the separating step 

4. The system of claim 3 wherein the separating means 35 mdudes ^ f ^ mgh . fr ^ component 
mckides means for implementing a discrete wavelet trans- bafied 0Q a ^ predetemiined f hre J ld> 

T'The system of claim 3 wherein the separating means . Y^**?™?* * ^ 

includes means for isolating the high-trequency component " Step of *°}Ti J m g h " frec J UCDC y exponent 

based on a second predetermined threshold. 40 bascd on a mcasure of mc buffer size ' 

6. The system of claim 5 wherein the second predeter- 11 ' ^ s y stem of claim 15 wnerein me Elating step 
mined threshold is based on a measure of the buffer size. includes the step of implementing an inverse discrete wave- 

7. The system of claim 5 wherein the isolating means let ta 11 ^ 0 ™ 1 * 

includes means for implementing an inverse discrete wave- 18 - ^ system of claim 13 wherein the determining step 

let transform. 45 further includes the step of filtering the high-frequency 

8. The system of claim 3 wherein the wavelet transform component from the sampled arrival rate. 

processor further includes means for filtering the high- 19. Tliememod of claim U further comprising the step of 

frequency component from the sampled arrival rate, modifying a second type packet based on the switch con- 

J>. The system of claim 1 wherein the flow control module gestion state, 

modifies a second type packet based on the switch conges- so 20. The method of claim 11 wherein the network is an 

tion state. ATM network, the packets of a first type comprise CBR and 

10, The system of claim 1 wherein the network is an AIM VBR packets, and the packets of a second type comprise 

network, the packets of a first type comprise CBR and VBR ABR packets, 
packets, and the packets of a second type comprise ABR 

packets. * * * * * 
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